Translation initiation start prediction in human cDNAs with high accuracy
نویسنده
چکیده
MOTIVATION Correct identification of the Translation Initiation Start (TIS) in cDNA sequences is an important issue for genome annotation. The aim of this work is to improve upon current methods and provide a performance guaranteed prediction. METHODS This is achieved by using two modules, one sensitive to the conserved motif and the other sensitive to the coding/non-coding potential around the start codon. Both modules are based on Artificial Neural Networks (ANNs). By applying the simplified method of the ribosome scanning model, the algorithm starts a linear search at the beginning of the coding ORF and stops once the combination of the two modules predicts a positive score. RESULTS According to the results of the test group, 94% of the TIS were correctly predicted. A confident decision is obtained through the use of the Las Vegas algorithm idea. The incorporation of this algorithm leads to a highly accurate recognition of the TIS in human cDNAs for 60% of the cases. AVAILABILITY The program is available upon request from the author.
منابع مشابه
PreTIS: A Tool to Predict Non-canonical 5’ UTR Translational Initiation Sites in Human and Mouse
Translation of mRNA sequences into proteins typically starts at an AUG triplet. In rare cases, translation may also start at alternative non-AUG codons located in the annotated 5' UTR which leads to an increased regulatory complexity. Since ribosome profiling detects translational start sites at the nucleotide level, the properties of these start sites can then be used for the statistical evalu...
متن کاملAccuracy improvement for identifying translation initiation sites in microbial genomes
MOTIVATION At present the computational gene identification methods in microbial genomes have a high prediction accuracy of verified translation termination site (3' end), but a much lower accuracy of the translation initiation site (TIS, 5' end). The latter is important to the analysis and the understanding of the putative protein of a gene and the regulatory machinery of the translation. Impr...
متن کاملGeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions.
Improving the accuracy of prediction of gene starts is one of a few remaining open problems in computer prediction of prokaryotic genes. Its difficulty is caused by the absence of relatively strong sequence patterns identifying true translation initiation sites. In the current paper we show that the accuracy of gene start prediction can be improved by combining models of protein-coding and non-...
متن کاملWhy is start codon selection so precise in eukaryotes?
Translation generally initiates with the AUG codon. While initiation at GUG and UUG is permitted in prokaryotes (Archaea and Bacteria), cases of CUG initiation were recently reported in human cells. The varying stringency in translation initiation between eukaryotic and prokaryotic domains largely stems from a fundamental problem for the ribosome in recognizing a codon at the peptidyl-tRNA bind...
متن کاملFour Ia invariant chain forms derive from a single gene by alternate splicing and alternate initiation of transcription/translation
We determined the structural basis for the presence of electrophoretically-distinct, antigenically-related forms of invariant chains in Ia oligomers, and established the mechanisms by which they can be expressed from a single gene. S1 nuclease protection assays indicated that, in B cells, transcription of this gene initiates at a minimum of three sites. Thus, unlike previously thought, invarian...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 18 2 شماره
صفحات -
تاریخ انتشار 2002